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Figure 1 : a/ Map of plasrnid pRIT14586 



BamHY 




B/ Coding sequence of the first 127 amino acids 
of protein D and multiple cloning site. The signal 
sequence is underlined. 

BamHI 

ATG GAT CCA AAA ACT TTA GCC CTT TCT TTA TTA GCA GCT GGC GTA CTA GCA GOT TGT AGC AGC 

Met Asp Pro Lys Thr Leu Ala Leu Scr Leu Leu Ala Ala Gly Val Leu Ala Gly Cys Scr Ser 

CAT TCA TCA AAT ATG GCG AAT ACC CAA ATG AAA TCA G AC AAA ATC ATT ATT GCT CAC CGT GGT 

His Ser Ser Asn Met Ala Asn Thr Gin Met Lys Ser Asp Lys Ue He Ue Ala His Arg Gly 

GCT AGC GGT TAT TTA CCA GAG CAT ACG TTA GAA TCT AAA GCA CTT GCT TTT GCA CAA CAG GCT 

Ala Ser Gly Tyr Leu Pro Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala 

GAT TAT TTA GAG CAA GAT TTA GCA ATG ACT AAG GAT GGT CGT TTA GTG GTT ATT CAC GAT CAC 

Asp Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val He His Asp His 

TTT TTA GAT GGC TTG ACT GAT GTT GCG AAA AAA TTC CCA CAT CGT CAT CGT AAA GAT GGC CGT 

Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe Pro His Arg His Arg Lys Asp Gly Arg 

TAC TAT GTC ATC G AC TTT ACC TTA AAA GAA ATT CAA AGT TTA GAA ATG ACA GAA AAC TTT GAA 

Tyr Tyr Val Ue Asp Phe Thr Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu 

Ncol Spel Xbal 

ACC ATG GCC ACG TGT GAT CAG AGC TCA ACT AGT GGA CAC CAT CAC CAT CAC CAT TAA TCT AGA 

Thr Met Ala Thr Cys Asp Gin Ser Ser Thr Ser Gly His His His His His His • 



The amino acid sequence of Figure 1 relates to Seq. ID no. 7 and the nucleic acid sequence of 
Figure 1 relates to Seq. ID. No. 6. 
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The DNA and amino acid sequences of Nef-His; Tat-His; Nef-Tat-His fusion and 
mutated Tat is illustrated. 

Pichia-expressed constructs (plain constructs) 
=> Ne f-HIS 

DNA sequence (Seq. ID. No. 8) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGA 
ATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAA 
AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGG 
CT AGAAGCACAAGAGGAGGAGGAGGTGGGTTTT C CAGTCACACCTCAGGTACCTTTA 
AGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGG 
GGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATC 
TACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTC 
AGAT AT C CACTGACCTTTGGATGGTGCTAC AAGCTAGT AC CAGTTGAGC CAGAT AAG 
GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCAT 
GGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCA 
TTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGC 
CACCATCACCATCACCATTAA 

Protein sequence(Seq. ID. No. 9) 



MGGKWS KS S WGWPTVRERMRRAE P AADGVGAAS RDLEKHGA I T S S NT AATN AACAW 
LEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWI 
YHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLH 
GMDDPEREVXjEWRFDSRLAFHHVARELHPEYFKNCTSGHHHHHH . 



^> Tat -HIS 

DNA sequence (Seq. ID. No. 10) 



ATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAA 
ACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTC 
ATAACAAAAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGA 
CCTCCTCAAGGCAGTCAGACTC^TCAAGTTTCTCTATCAAAGCAACCCACCTCCCAA 
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TCCCGAGGGGACCCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCAT 
TAA 

Protein sequence-(Seq~ID.-No—l 1) 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFITKALGISYGRKKRRQRRR 
PPQGSQTHQVSLSKQPTSQSRGDPTGPKETSGHHHHHH. 



=> Nef- Tat-HIS 

DNA sequence (Seq. ID. No. 12) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGA 
ATGAGACGAGCTGAGC CAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGAC CTGGAA 
AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGG 
CTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTA 
AGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGG 
GGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATC 
TACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTC 
AGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAG 
GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCAT 
GGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCA 
TTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTC^AGAACTGCACTAGTGAG 
CCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCT 
TGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACA 
AAAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCT 
CAAGGCAGTCAGACTGATCAAGTTTCTCTATCAAAGCAACC 

GGGGACCCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCATTAA 
Protein sequencefSeq. ID. No. 13) 



MGGKWS KS SWGWPTVRERMRRAE P AADGVGAAS RDLEKHGAI TS SNTAATNAACAW 
LEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWI 
YHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLH 
GMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTA 
CTNCTCKKCCFHCQVCFITKJUjGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSR 
GDPTGPKETSGHHHHHH. 



E.coli-expressed constructs (fusion constructs) 



=> LipoD-Nef-HIS 
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t>NA sequence (Seq. ID. No. 14) 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 
The Lipidation Signal Sequence js underlined. After processing ,Jhe cysteine 
coded by the TGT codon, indicated with a star, becomes the amino terminal 
residue which is then modified by covalently bound fatty acids. 

* 

ATGGATCCAAAAACTTTAGCCCTTTCTTTATTAGCAGCTGGCGTACTAGCAGGT TGT 
AGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAAATCATTATT 
GCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCTAAAGCACTT 
GCTTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACTAAGGATGGT 
CGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTTGCGAAAAAA 
TTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTTACCTTAAAA 
GAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGCAAGTGGTCA 
AAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCA 
GCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACA 
AGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAG 
GAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 
CACT C CCAACGAAGACAAGATATC CTTGAT CTGTGGATCTAC CAC ACACAAGGCTAC 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 
GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 
GGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGA 

GAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGCCACCATCACCATCACCAT 
TAA 

Protein sequence ofthe processed lipidated ProtD-Nef-HIS protein (Seq. ID. No. 15) 
(Amino-acids corresponding to Prot D fusion partner are in bold) 



CSSKS SNMANTQMKS DK 1 1 1 AHRGAS G YLP EHTLE S KALAF AQQ AD YL E QDIiAMTKD 
GRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGGKW 

SKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQE 
EEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQG 
YFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDP 
EREVLEWRFDSRLAFHHVARELHPEYFKNCTSGHHHHHH. 



=> LipoD-Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 16) 



• " 09/509239 

WO 99/16884 PCT/EP98/06040 

5/17 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 
The Lipidation Signal Sequence is underlined. After processing, the cysteine 
coded by the TGT codon, indicated with a star, becomes the amino terminal 
_ residue which is then modified by coyalently bound fatty acids. 



* 

ATGGATCCAAAAACTTTAGCCCTTTCTTTATTAGCAGCTGGCGTACTAGCAGGT TGT 
AGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAAATCATTATT 
GCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCTAAAGCACTT 
GCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACTAAGGATGGT 
CGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTTGCGAAAAAA 
TTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTTACCTTAAAA 
GAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGCAAGTGGTCA 
AAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCA 
GCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACA 
AGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAG 
GAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 
CACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 
GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 
GGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGA 
GAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTA 
GAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGT 
AAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAAAAGCCTTAGGCATCTCC 
TATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGACTCAT 
CAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCG 
AAGGAAACTAGTGGC CAC CAT CAC CAT C ACCATTAA 



Protein sequence of the processed lipidated ProtD-NEF-TAT-HIS protein (Seq. ID. No. 
ill 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

CSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYLEQDLAMTKD 
GRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGGKW 

SKSSVVGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSN^ 

EEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQG 
YFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDP 
EREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCY 
CKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSRGDPTG 
PKETSGHHHHHH . 
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=> ProtD-Nef -HIS 
^ DNA sequence (Seq, ID. No. 18) _ 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 

ATGGATCCAAGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAA 
ATCATTATTGCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCT 
AAAGCACTTGCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACT 
AAGGATGGTCGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTT 
GCGAAAAAATTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTT 
ACCTTAAAAGAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGC 
AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGA 
GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGA 
GCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCA 
CAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATG 
ACTTACAAGG CAG CTGTAGAT CTTAG CCACTTTTTAAAAGAAAAGGGGGGACTGGAA 
GGGCTAATTCAGTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACA 
CAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCA 
CTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAG 
GCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGAT 
GACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCAC 

GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGCCACCATCAC 
CATCACCATTAA 



Protein sequence (Seq. ID. No. 19) 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

MDPSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYL 

EQDLAMTKDGRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLK 

EIQSLEMTENFETMGGKWSKSSVVGWPTVRERMRRAEPAADGVGAASRDL 

EKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSH 

FLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGW 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFH 

HVARELHPEYFKNCTSGHHHHHH . 



=> ProtD-Nef -Tat-HIS 

DNA sequence (Seq. ID. No. 20) 
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Nucleotides corresponding to the Prot D Fusion Partner are in bold. 

ATGGATCCAAGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAA 
ATCATTATTGCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCT 
AAAGCACTTGCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACT 
AAGGATGGTCGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTT 
GCGAAAAAATTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTT 
ACCTTAAAAGAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGC 
AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGA 
GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGA 
GCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCA 
CAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATG 
ACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAA 
GGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACA 
CAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCA 
CTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAG 
GCCAATAAAGGAGAGAACAC CAGCTTGTTACACC CTGTGAGC CTGCATGGAATGGAT 
GACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCAC 
GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGAT 
CCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAAT 
TGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAAAAGCCTTA 
GGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGT 
CAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCG 
ACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCATTAA 



Protein sequence (Seq. ID. No. 21) 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

MDPSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKAIiAFAQQADYIjEQDIiAMT 
KDGRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGG 

KWSKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEA 

QEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHT 

QGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMD 

DPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTN 

CYCKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSRGDP 
TGPKETSGHHHHHH . 



^> Tat-MUTANT-HIS 



DNA sequence (Seq. ID. No. 22) 



09/509239 



WO 99/1 6884 PCT/EP98/06040 

8/17 



ATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATC 4 0 
CAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTG 8 0 

TAAAAAGTGTTG^TTTCAT^GCCAAGTt 1 20' 

GCTG C CTT AGG CAT CT C CT ATGG C AGG AAG AAG CGGAGAC 16 0 

AG CGACGAAG AC CT C CT CAAGG C AGT CAGACT C AT C AAGT 2 0 0 

T T CT CT AT C AAAGC AAC C C AC CT C C C AAT C C AAAGGGGAG 2 4 0 

CCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATC 2 8 0 

ACCATTAA 2 8 8 



Protein sequence(Seq. ID. No. 23) 

Mutated amino-acids in Tat sequences are in bold. 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFIT 4 0 
^ AAliGI S YGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGE 8 0 

5 PTGPKETSGHHHHHH . 95 



: z^z 
i - i 



=>Nef- Tat-Mutant-HlS 

DNA sequence(Seq. ID. No. 24) 

ATGGGTGG CAAGTGGT CAAAAAGT AGTGTGGTTGGATGG C 4 0 
CTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 8 0 
AG ATGGGGTGGG AGC AG CAT CT C G AG AC CTGG AAAAAC AT 120 
GGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTG 160 
CTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGG 2 00 
TTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACT 24 0 
T ACAAGG C AG CTGT AG ATCTTAG C CACTTTTTAAAAGAAA 2 80 
AGGGGGGACTGGAAGGGCT AATT C ACT C C CAACGAAGAC A 320 
AGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 3 60 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCA 40 0 
GATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACC 44 0 
AGTT GAG C C AG AT AAGGTAGAAGAGG CC AAT AAAGGAG AG 48 0 
AACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGG 52 0 
ATGAC C CT G AGAGAGAAGTGT T AG AGTGG AGGTTT G ACAG 560 
CCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCG 600 
GAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTA 64 0 
G ACT AG AG C C CTGG AAG CAT C C AGGAAGT C AG C CT AAAA C 6 8 0 
TGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCAT 720 
TGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCT 76 0 
ATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCA 8 00 
AGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCC 840 
ACCTCCCAATCCAAAGGGGAGCCGACAGGCCCGAAGGAAA 880 
CTAGTGGCCACCATCACCATCACCATTAA 9 0 9 
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Protein sequence (Seq. ID. No. 25) 

Mutated amino-acids in Tat sequence are in bold. 

MGGKWSKSSVVGWPTVRERMRRAEPAADGVGAASRDLEKH 4 0 ~ 
GAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMT 8 0 
YKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGY 12 0 
FPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGE 16 0 
NTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHP 2 0 0 
EYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFH 24 0 
CQVCFITAALGISYGRKKRRQRRRPPQGSQTHQVSLSKQP 28 0 

TSQSKGEPTGPKETSGHHHHHH . 3 02 
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Fig . 5 SDS-PAGE: Nef-Tat-his fusion protein 




Coomassie blue G250 



1: MW (175/83/62,5/47,5/32,5/25/16,5/6,5 kDa) 

2: TNH/23 SP eluate (4 ug) 

3: TNH/23 Superdex200 eluate (4 ug) 

4: TNH/23 Purified bulk (4 ug) 

5: TNH/22 Purified bulk (4 ug) 



6: TNH/23 Purified bulk (4 ug) / non reducing conditions 
7: TNH/22 Purified bulk (4 |jg) / non reducing conditions 
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Fig. 8 Cell binding assay 




protein concentration (pg/ml) 
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F i g - 9 Inhibition of eeii growth 
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